Aesthetic Critiques Generation for Photos: Supplementary Material
نویسندگان
چکیده
As mentioned in our paper, we study a new problem, aesthetic critiques generation, which is different from conventional image captioning. Many recent works [1, 2] started to argued that conventional evaluation criteria (BLEU, METEOR and CIDEr) borrowed from machine translation community are unsuitable for image captioning task. How to choose a suitable criterion is still a tricky problem in image captioning, and this issue is more significant in our proposed new problem. In this section, we provide the results of our approaches on conventional automatic evaluation criteria. Then we give an example to explain why these criteria are improper. Compared to them, SPICE suits better to our task, although human evaluation could be regarded as a more reliable measure. The results of our approaches on BLEU, METEOR, and CIDEr are presented in Table 1. Compared to recent captioning works, the values shown in Table 1 are quite low. In the beginning, we are curious about why the automatic evaluation results are different from human’s (as shown in Section 5.2 of our main paper). So, we start to study whether these automatic criteria are reasonable. First, let us note that the ground truths of common image captioning and our PCCD datasets are different. In the former (e.g., MSCOCO and Flickr30k), an image is described by using multiple similar sentences, and thus the ground-truth captions are near-duplicates for the same image. In PCCD, an image is described by multiple sentences that are dissimilar (or not synonymous), and thus the ground-truth captions are
منابع مشابه
Supplementary Material : A Richly Annotated Catalog of Surface Appearance
In this supplementary material, we provide further details for each of our Amazon Mechanical Turk (MTurk) tasks, and on our implementation. The data from all of our experiments is fully open and available on-line at http://opensurfaces.cs.cornell.edu/. Statistics about the experiments and the results are also hosted online, and they will be continually updated as the database grows. Task descri...
متن کاملPhoto Aesthetic/Quality Evaluation and Editing
The problems of photo aesthetic and photo quality are similar, but not exactly the same. In photo aesthetic assessment, we seek photos that looks beautiful to people. The aesthetic score of a photo can be affected by the photo’s color tone, composition, layout, and etc. In photo quality assessment, we seek photos that are in high quality. The quality score of a photo can be affected by shake, b...
متن کاملAesthetic Quality Assessment of Photographic Images: A Literature Survey
Aesthetics is a branch of philosophy which deals with the study of emotions in relation to the sense of beauty. Nowadays, there is a tremendous increase in the use of digital images as a means for representing and communicating information. With the considerable increase of consumer photos, evaluating the quality of photos has become a difficult task. People are more interested in photos that a...
متن کاملAutomatic Aesthetic Photo-Rating System
Growing prevalence of smartphone makes photography easier than ever. However, the quality of photos varies widely. Because judging the aesthetic of photos is based on several "rule-of-thumb", it remains difficult for computers to rate photos without manual intervention. In this work, we utilize aesthetic features of photos and machine learning techniques to automatically distinguish good photos...
متن کاملThe Visual Turing Test for Scene Reconstruction Supplementary Material
We show additional typical good and bad image results in Figures 3 and 4. Even with good geometry rendering, subjects are still more likely to choose the reference photo if people are present (e.g., the first example in Figure 4). Figure 6(b) shows the increase in performance if we omit photos containing people; observe that the scores are sigFigure 5. Black and white photos are anomalous (left...
متن کامل